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PROCESS AND DEVICE FOR THE SUPPRESSION OF NOISE AND 
ECHOES 

Assignee: ALCATEL 

OurRef: Q67428 



The following documents and fees are submitted herewith in connection with the above 
application for the purpose of entering the National stage under 35 U.S.C. § 371 and in 
accordance with Chapter II of the Patent Cooperation Treaty: 

□ an executed Declaration and Power of Attorney. 

0 an English translation of the International Application. 
0 1 sheet of drawings. 

□ an English translation of Article 19 claim amendments. 

0 an English translation of Article 34 amendments (annexes to the IPER). 

□ an executed Assignment and PTO 1595 form. 

□ a Form PTO-1449 listing the ISR references, and a complete copy of each reference. 
13 a Preliminary Amendment 

The Declaration and Power of Attorney, Assignment, Form PTO-1449 listing the 
International Search Report (ISR) references and a complete copy of each reference will be 
submitted at a later date. 

It is assumed that copies of the International Application, the International Search Report, 
the International Preliminary Examination Report, and any Articles 19 and 34 amendments as 
required by § 371 (c) will be supplied directly by the International Bureau, but if further copies 
are needed, the undersigned can easily provide them upon request. 
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PLEASE SEE THE ATTACHED PRELIMINARY AMENDMENT BEFORE 
CALCULATING THE FEE. CLAIM CALCULATIONS ARE BASED ON THE 
ARTICLE 34 AMENDMENTS MADE TO THE CLAIMS 

The Government filing fee is calculated as follows: 

Total claims 12_ - 20 = x $18.00 $.00 

Independent claims 1_ - 3 = x $84.00 $-00 

Base Fee $890.00 

TOTAL FEE $890.00 



A check for the statutory filing fee of $890.00 is attached. You are also directed and 
authorized to charge or credit any difference or overpayment to Deposit Account No. 19-4880. 
The Commissioner is hereby authorized to charge any fees under 37 C.F.R. §§ 1.16, 1.17 and 
1.492 which may be required during the entire pendency of the application to Deposit Account 
No. 19-4880. A duplicate copy of this transmittal letter is attached. 
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PATENT APPLICATION 
IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re application of PCT/EP00/04803 
Rainer MARTIN Attorney Docket Q67428 

Appln.No.: Not Assigned 

Confirmation No. : Not Assigned Group Art Unit: Not Assigned 

Filed: November 30, 2001 Examiner: Not Assigned 

For: PROCESS AND DEVICE FOR THE SUPPRESSION OF NOISE AND ECHOES 
PRELIMINARY AMENDMENT 

Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

Prior to examination, please amend the above-identified application as follows: 

IN THE CLAIMS : 

Please enter the following amended claims: 

7. (Amended) Process according to claim 1, characterised in that the signal 
estimators (Si, S 2 , S N ) are calculated in a transformed domain, for example into a Fourier or 
discrete cosine transformation domain. 
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Preliminary Amendment 
Attorney Docket Q67428 

REMARKS 

Entry and consideration of this Amendment is respectfully requested. 



Respectfully submitted, 



SUGHRUE MION, PLLC 
2100 Pennsylvania Avenue, N.W. 
Washington, D.C. 20037-3213 
Telephone: (202) 293-7060 
Facsimile: (202) 293-7860 

Date: November 30, 2001 
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Preliminary Amendment 
Attorney Docket Q67428 

APPENDIX 

VERSION WTTH MARKINGS TO SHOW CHANGES MADE 

IN THE SPECIFICATION ; 

The specification is changed as follows: 

IN THE CLAIMS : 

The claims are amended as follows: 

7 . Process according to enc of the preceding Claimsdadnil, characterised in that 
the signal estimators (Si, S 2 , S N ) are calculated in a transformed domain, for example into a 
Fourier or discrete cosine transformation domain. 



Process and device for the suppression of noise and echoes 



The invention concerns a process for noise and echo 
suppression in signals, for example video or speech 
signals. The invention can be used for example in a 
telephone facility containing a handsfree facility, in a 
videophone or a medical imaging device. The invention 
furthermore concerns a device for implementing this 
process . 

During the recording and transmission of video or speech 
signals, these signals are often distorted by noise and, in 
the case of transmission over lines, also frequently 
distorted by (line) echoes. Furthermore, in the case of 
speech signals and the use of a handsfree facility, 
acoustic echoes also occur. In order to improve the 
subjective quality and the audibility of speech or the 
sharpness and contrast of images, these disturbances can be 
reduced by background noise reduction and, if necessary, by 
an echo reduction device, for example an echo compensator 
or a level balancer. 

The background noise reduction is usually carried out by 
means of a spectral weighting method into a transformed 
domain, for example according to a Fourier transformation 
or a discrete cosine transformation. Methods for the 
improvement of noisy speech signals are described, for 
example, in Y. Ephraim and D. Malah, 'Speech Enhancement 
Using a Minimum Mean- Square Error Short -Time Spectral 
Amplitude Estimator', IEEE Trans. Acoustics, Speech and 
Signal Processing, Vol. 32, pp. 1109-1121, 1984, and in 
D. Malah, R.V. Cox and A.J. Accardi , 'Tracking Speech- 
Presence Uncertainty to Improve Speech Enhancement in Non- 
Stationary Noise Environments', Proc . IEEE Intl. Conf . 
Acoustics, Speech, Signal Processing (ICASSP) , 1999. 
Similar methods for the reduction of noise in image signals 
are known; see, for example, J.S. Lim, 'Image Restoration 
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by Short Space Spectral Subtraction', IEEE Trans. Acoustic, 
Speech and Signal Proc . , Vol. 28, pp 191-197, 1980 or T. 
Aach and D. Kunz, Spectral Estimation Filters for Noise 
Reduction in X-ray Fluoroscopy Imaging', Proc. EUSIPCO, pp. 
5 571-574, 1996. 

The reduction of acoustic echoes or line echoes is mainly 
carried out with an echo compensator, a level balancer, a 
centre clipper or any combinations of this method. An 

10 overview of the prior art relating to speech signals is 
given, for example, in E. Hansler, 'The Hands -Free 
Telephone Problem - An Annotated Bibliography', Signal 
Processing, Vol. 27, pp. 259-271, 1992, and in E. Hansler, 
'The Hands -Free Telephone Problem - An Annotated 

15 Bibliography Update", Annales des Telecommunication, 
Vol. 49, No. 7-8, pp. 360-367, 1994. 

Methods which reduce noise and echoes together have also 
been known for some time. Earlier methods of this type 

2 0 consist of a simple series circuit of a background noise 

reduction device and an echo compensator, as described, for 
example, in U.S. Patent 5,680,393 (Bourmeyster et al . ) . 
Improved methods combine the background noise and echo 
reduction in the context of a true synergy. Here a single 

25 filter is used for simultaneous reduction of background 
noise and the residual echoes remaining after echo 
compensation. Increased echo attenuation and a complexity 
advantage is achieved as a result. Such a method is 
described, for example, in S. Gustafsson et al . : Combined 

30 Acoustic Echo Control and Noise Reduction for Hands-free 
Telephony, Signal Processing, vol. 64, pp. 21-32, 1998. 
These methods generally consist of the combination of an 
echo compensator with a filter reducing the residual echoes 
and the noise. 

35 

The disadvantage of the previously known methods is that 
the methods for common noise and residual echo suppression 
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use only one single filter suppressing the residual echoes 
and the noise. Such a method is therefore inadequate when 
used for changing requirements and varying signal 
characteristics. For example, different operating 
5 conditions are possible during the use of a hands- free 

facility, depending on whether only the local subscriber is 
speaking or only the remote subscriber is speaking, or both 
subscribers are speaking. Since in these operating 
conditions the requirements of the echo attenuation, for 

10 example, are different, it appears to be useful to use 

different filters adapted to the operating conditions and 
to change these over according to the operating condition 
or to use them in a suitably weighted sum. Distortion of 
the useful signal (for example speech) can therefore be 

15 reduced, or even the quality of the remaining background 
noise improved. In image processing, improvements are thus 
possible with regard to disturbing block effects, loss of 
contrast or even the effects of unnatural background 
patterns ('gratings'). 

20 

The object of the invention is therefore to specify a 
process and a device that facilitates effective noise 
and/or echo reduction, and in so doing causes only minimal 
distortion in the useful signal and leads to high quality 
25 in the residual interference signal, where different signal 
filters are used in a weighted sum. 

This object is achieved by a process that has the features 
of Claim 1. 

30 

The process is based on the fact that the distorted signal 
does not contain the useful signal, noise and any echo 
signals at all points in time or in all picture elements or 
image regions and therefore the use of different signal 
35 estimators or signal filters at different instants or for 
different picture elements is an advantage. For example, 
the useful signal is then contained only in the distorted 



4 



signal when video information or speech is actually being 
transmitted. Similarly, the occurrence of echoes depends on 
the activity of the signal sources generating the echo. 

5 According to the invention, different signal estimators or 
signal filters are now used for the various cases, for 
example, 'video or speech information present in the 
signal', 'noise present in the signal',' echoes present in 
the signal ' , and any combination of these and according to 

10 the level of the signals relative to each other - any 

linear or nonlinear combinations of these signal estimators 
or signal filters also being possible. The useful signal is 
extracted in this way from the distorted signal by one of 
the estimators or filters adapted to the respective 

15 situation. The changing-over of these estimators or filters 
or their relative weighting in a combination is determined 
with the aid of variables which are determined from the 
distorted signal or from other signal sources. 

2 0 Fig. 1 shows the principal arrangement, z (k) denotes a 

speech or video signal and §(k) the signal free of noise 
interference or echoes, x(k) denotes any existing external 
control signal . The distorted signal z (k) is now fed to N 
signal estimators or signal filters S l7 S 2/ S N . A 

25 combination unit K which processes the input signal z (k) 

and any existing external control signal x(k), controls the 
selection and/or the combination of the signals delivered 
by the signal estimators or signal filters S lf S 2 , S N 
and forms the noise-free and echo-free signal s(k). 

30 

Special statistical methods are suitable for determining 
the estimators and the relative weighting of these 
estimators to each other. In particular, the selection of 
one or more suitable signal estimators or signal filters at 
35 a specific instant or in a specific picture element can be 
controlled from the estimated probabilities of the presence 
of the useful signal, noise or echo signal component in the 



5 



distorted signal . An exemplary embodiment of such an 
estimation method for use in a hands-free facility is 
specified below. 

5 Fig. 2 shows the combination comprising an echo compensator 
and an adaptive post filter. The post filter is employed in 
the frequency domain, using a discrete Fourier 
transformation (DFT) , a spectral weighting and an 
'overlap/add' signal synthesis. The post filter is 
10 comprised of two part filters and an adaptive combination 
unit, as shown below. 

We consider the sampled and band-limited signals x(i), 
y(i), z(i) and §(i), where i denotes the discrete time 

15 index. x(i) is the signal of the remote speaker and y(i) 

the microphone signal which is comprised of a speech signal 
s(i), a noise signal n(i) and an echo signal e(i), y(i) = 
s(i) + n(i) + e(i). The echo -compensated signal z(i) is the 
microphone signal minus the echo e(i), z(i) = y(i) - e(i) = 

20 s(i) + n(i) + e(i) estimated by the compensator. e(i) 

denotes the residual echo after compensation. We further 
assume that the signals s(i), x(i) and n(i) are 
statistically independent. The compensated signal z(i) 
distorted by noise and residual echoes is transformed into 

25 the frequency domain by the use of a window function h(i) , 
by assembling a frame of L successive samples of z(i), 
weighting this frame with the window function and 
calculating a DFT of length L. The window is displaced by R 
samples on the input signal prior to the next DFT 

3 0 calculation. The DFT analysis with a sliding window results 
in a quantity of frequency domain signals which can also be 
specified with 

L-l 

Z{\ k) = J2 Z ( XR - riHrie- j2 * k * /L (1) 

35 
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where A specifies an under-sampled time domain index, A s 
Z, and k a frequency index, k £ {0,1..., L-l}, and the 
latter stands in relationship to the standardised centre 
frequency of the DFT bands, where Q k by ^k = k2n/L. 
5 Typically, a sampling rate of / A = 8000 Hz and a DFT length 
of L = 2R = 256 is used. 



Likewise, the Fourier coefficients of all other signals of 
the kth frequency index are specified by 

• S k = A k exp(ja k ) (undistorted local speech), 

• Y k = R k exp(j' k ) (distorted local speech), 

• X k = B k exp(jP k ) (remote speech), 

• Z k = D k exp(jC k ) (compensated signal) 

• S k = A k exp(j« k ) (interference- suppressed signal) 

where the time index A has been suppressed for the sake of 
clarity. 



2 0 The distribution density functions of the local and remote 
speech can be specified by p s (A k ,« k ) and p x (B k ,P k ), 
respectively : 



P.{Ak> a k ) = P(Hl k )p 3 {Ak, * k \Hl) + P{H° sk )S{A ky a k ) (2) 
P*{B k ,0 k ) = P{Hl k ) Px {B k ,p k \Hl) + P{H° xk )5(B k , ft) (3) 

where P (H x sk ) and P(H 1 xk ) stand for the probabilities that 
local or remote speech is present and P(H° sk ) = 1 - P (H 1 ^) ) 
and P(H° X]C ) = 1 - P(H 1 xk ) applies. 5(.) denotes the Dirac 
function. 
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The cost function 




+ P{Hl)P{Hl) jjjj{A-AYp{Z\A,a,B,P) 

■ p 9 {A, a\H l a ) Px {B, P\H l x )dAdadBdp 
+ P{H 0 s )P{H l x )A?p{Z\H° a , Hi) 

+ P{H])P(Hl) J J {A -A? 

■ p(Z\A, a, H°)p s {A, a\H])dAd<x}dZ 



5 is now minimised, where A denotes the spectral amplitudes 
of the estimated local signal , which in turn are a function 
of Z. The solution is given by 



~ _ P{H})P{Hl)p(Z\HlHl) 
A _ 

, P{H))P{H Q x )p{Z\H],H») , 
+ Pz ' 
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where 



5! = E{A\Z,H 9 \H X } 



(6) 
(7) 



S 2 = E{A\Z,H 1 S ,H X '} 



P s = P(H 0 s )P{H 0 x )p{Z\HlH° x ) 
+ P{H])P{Hl)p{Z\Hl,Hl) 
+ P{H°)P{H x )p{Z\H° s ,Hl) 
+ P{Hl)P{H x >)p{Z\HlH x >) 



and 




(12) 



(10) 



(11) 



0) 



5 E{.} denotes the expected value and E{ . | . } a conditional 
expected value. The conditional expected value is the best 
estimated value in the context of the mean square error. 

p n , P d/ Y n/ Y d , £ n , and ^ d are the power density spectra of 
10 the interference, the respective a posteriori SNR, and the 
a priori SNR for the case where only the local speaker is 
active (index n) or both speakers are active (index d) . In 
this case the interference-suppressed signal is therefore 
formed from a linear combination of two signal estimators S a 
15 and S 2/ the estimator S x being predominantly used when the 
local and remote speakers are active, and the estimator S 2 
being predominantly used when only the local speaker is 
active. The process described here, using several signal 
estimators, distinguishes itself from the prior art in that 
2 0 it produces only very small distortions in the useful 
signal, with completely naturally- sounding residual 
background noise. 
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Process for the reduction of noise and echoes in a 
useful signal (z(k)) 

characterised in that 

the useful signal (z(k)) affected by noise and echoes 
is fed to several signal estimators {S 1 , S 2 , .... S N ) , 
that are optimised for different signals, namely for 
the useful signal or noise or echoes, or different 
signal combinations, namely for a combination 
comprising useful signal and noise and echoes, or for 
a combination comprising useful signal and noise, or 
for a combination comprising echo and noise, and that 
the output signals of the signal estimators (S l7 S 2 , 
. . . , S N ) are combined in a combination unit (K) in 
relation to the probability relationships of the 
useful signal, of the noise and of the echo in the 
useful signal (z(k)) affected by the noise and echoes, 
determined according to 



Z J + p 1 ^2 



25 so that a noise-reduced and echo-reduced output signal 

(s(k)) is delivered by the combination unit (K) . 

2. Process according to Claim 1, characterised in that 

the useful signal (z(k)) affected by noise and echoes 
30 is a speech signal affected by noise and echoes. 

3. Process according to Claim 1, characterised in that 

the useful signal (z(k)) affected by noise and echoes 
is a video signal affected by noise and echoes. 
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Process according to Claim 1, characterised in that 

the echoes are acoustic echoes . 

Process according to Claim 1, characterised in that 

the echoes are line echoes. 

Process according to Claim 1, characterised in that 

the echoes are reduced by an echo compensator (C) and 
the residual echoes and the noise are reduced by 
downstream signal estimators (S lf S 2 , S N ) . 

Process according to one of the preceding Claims, 
characterised in that the signal estimators (S x , S 2 , 
. . . , S N ) are calculated in a transformed domain, for 
example into a Fourier or discrete cosine 
transformation domain. 

Process according to Claim 1, characterised in that 

the signal estimators (S^ S 2 , S N ) are calculated 

according to the principle of least error squares. 

Process according to Claim 1, characterised in that 

only the amplitudes of the signals in the transformed 
domain are taken into account by the signal estimators 
(S 1# S 2 , . . . , S N ) . 

Process according to Claim 1, characterised in that a 

cost function is minimised by the signal estimators 
(S 1# S 2 , S K ) jointly. 

Process according to Claim 1, characterised in that 
the output signals of the signal estimators (S a , S 2 , 
. . . , S N ) are weighted and averaged and the weights are 
calculated from the probabilities of the presence of 
the useful signal or of the noise or of the echoes or 
from the combinations comprising useful signal, noise 
and echoes . 
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12. Process according to Claim 1, characterised in that 

acoustic echoes and noise are simultaneously 
suppressed and the combination comprising the output 
5 signals of the signal estimators (S 1# S 2 , . . . , S N ) is 

dependent upon the activity of the local and of the 
remote speaker of a handsfree facility, and the 
combination unit (K) is additionally controlled via a 
control signal (x(k)) that is dependent upon the 
10 remote speaker. 



Process and device for the suppression of noise and echoes 
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Fig. 1: Device for the reduction of noise and echo 
interference by means of N signal estimators or signal 
filters and a combination unit K. 
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Fig. 2: Device for the joint reduction of acoustic echoes 
and noise. 
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As a below named inventor, I hereby declare that my residence, mailing address and citizenship are as stated below next to my 
name: that I verily believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint 
inventor (if plural names are listed below) of the subject matter claimed and for which a patent is sought in the application 
entitled: "Process and device for the suppression of noise and echoes" 
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